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Related Application 

The subject matter of this application is related to the subject matter in a 
co-pending non-provisional application by the same inventors as the instant 
application entitled, " Solving Systems Of Nonlinear Equations Using Interval 
Arithmetic And Term Consistency", having serial number 09/991,477, and a filing 
date of 16 November 2001 (Attorney Docket No. SUN-P6429-SPL). 

BACKGROUND 

Field of the Invention 

The present invention relates to performing arithmetic operations on 
interval operands within a computer system. More specifically, the present 

1 

Attorney Docket No. SUN-P7075-SPL Inventors: Walster et al. 

ARPC \MY DOCUMENTSVSUN MICROSYSTEMS\SUN-P7075-SPL\SUN-P7075-SPL APPLICATION DOC 



invention relates to a method and an apparatus for using a computer system to 
solve a system of nonlinear equations with interval arithmetic. 



Related Art 

Rapid advances in computing technology make it possible to perform 
trillions of computational operations each second. This tremendous 
computational speed makes it practical to perform computationally intensive tasks 
as diverse as predicting the weather and optimizing the design of an aircraft 
engine. Such computational tasks are typically performed using machine- 
representable floating-point numbers to approximate values of real numbers. (For 
example, see the Institute of Electrical and Electronics Engineers (IEEE) standard 
754 for binary floating-point numbers.) 

In spite of their limitations, floating-point numbers are generally used to 
perform most computational tasks. 

One limitation is that machine-representable floating-point numbers have a 
fixed-size word length, which limits their accuracy. Note that a floating-point 
number is typically encoded using a 32, 64 or 128-bit binary number, which 
means that there are only 2 32 , 2 64 or 2 128 possible symbols that can be used to 
specify a floating-point number. Hence, most real number values can only be 
approximated with a corresponding floating-point number. This creates 
estimation errors that can be magnified through even a few computations, thereby 
adversely affecting the accuracy of a computation. 

A related limitation is that floating-point numbers contain no information 
about their accuracy. Most measured data values include some amount of error 
that arises from the measurement process itself. This error can often be quantified 
as an accuracy parameter, which can subsequently be used to determine the 
accuracy of a computation. However, floating-point numbers are not designed to 
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keep track of accuracy information, whether from input data measurement errors 
or machine rounding errors. Hence, it is not possible to determine the accuracy of 
a computation by merely examining the floating-point number that results from 
the computation. 

Interval arithmetic has been developed to solve the above-described 
problems. Interval arithmetic represents numbers as intervals specified by a first 
(left) endpoint and a second (right) endpoint For example, the interval [a, b], 
where a < b, is a closed, bounded subset of the real numbers, i?, which includes a 
and b as well as all real numbers between a and b. Arithmetic operations on 
interval operands (interval arithmetic) are defined so that interval results always 
contain the entire set of possible values. The result is a mathematical system for 
rigorously bounding numerical errors from all sources, including measurement 
data errors, machine rounding errors and their interactions. (Note that the first 
endpoint normally contains the "infimum", which is the largest number that is less 
than or equal to each of a given set of real numbers. Similarly, the second 
endpoint normally contains the "supremum" which is the smallest number that is 
greater than or equal to each of the given set of real numbers.) 

One commonly performed computational operation is to find the roots of a 
nonlinear equation. This can be accomplished using Newton's method. The 
interval version of Newton's method works in the following manner. From the 
mean value theorem, 

where £ is some generally unknown point between x and x* If x* is a zero off, 
then/(**) = 0 and, from the previous equation, 
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Let X be an interval containing both x and x * Since £ is between x and x * it 
follows that £ eX. Moreover, from basic properties of interval analysis it follows 
5 that/ '( 0 ef \X). Hence, x * e N(x,X) where 

N(xJQ=x-fixW'UQ. 

Temporarily assume 0 £ fX%) so that JV(x,Jl) is a finite interval. Since any zero of 
10 /inXis also in N(x,X) 9 the zero is in the intersection X nN{x,X). Using this fact, 
we define an algorithm for finding x* Let X 0 be an interval containing x* For n 
= 0, 7, 2, define 

x /? = m{X n ) 

15 Mx^)=x„-/(x«)//X^) 
X w+7 =X„nN(x„,X n ), 

wherein m(X) is the midpoint of the interval X, and the notation f\x n ) makes 
explicit the fact that evaluating/ at the point x n must be done using interval 

20 arithmetic. We call x n the point of expansion for the Newton method. It is not 
necessary to choose x n to be the midpoint of X„. The only requirement is that 
x n e X n to assure that x* e N(x n , X n ). However, it is convenient and efficient to 
choose x» = m(X n ). Note that the roots of an interval equation can be intervals 
rather than points when the equation contains non-degenerate interval constants or 

25 parameters. 

The interval version of Newton's algorithm for finding roots of nonlinear 
equations is designed to work best "in the small" when nonlinear equations are 
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approximately linear. For large intervals containing multiple roots, the interval 
Newton algorithm splits the given interval into two sub-intervals that are then 
processed independently. By this mechanism all the roots of a nonlinear equation 
can be found. 

One problem is applying the multivariate generalization of the interval 
Newton algorithm to large rc-dimensional interval vectors (or boxes) that contain 
multiple roots. In this case, the process of splitting in ^-dimensions can lead to 
exponential growth in the number of boxes to process. 

It is well known that this problem (and even the problem of computing 
"sharp" bounds on the range of a function of w-variables over an ^-dimensional 
box) is an "NP-hard" problem. In general, NP-hard problems require an 
exponentially increasing amount of work to solve as n 9 the number of independent 
variables, increases. 

Because NP-hardness is a worst-case property and because many practical 
engineering and scientific problems have relatively simple structure, one problem 
is to use this simple structure of real problems to improve the efficiency of 
interval nonlinear equation solving algorithms. 

Hence, what is needed is a method and an apparatus for using the structure 
of nonlinear equations to improve the efficiency of interval root-bounding 
software. To this end, what is needed is a method and apparatus that efficiently 
deletes boxes or parts of "large" boxes that the interval Newton algorithm can 
only split. 

SUMMARY 

The present invention combines various methods to speed up the solution 
of systems of nonlinear equations. The combined methods use the structure of 
nonlinear equations to efficiently delete parts of all of large sub-boxes that would 
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otherwise have to be split. One embodiment of the present invention provides a 
computer-based system for solving a system of nonlinear equations specified by a 
vector function, f f wherein f(x) = 0 represents fj(x) = 0,f 2 (x) = 0,f 3 (x) = 0, . . 
f n (x) = 0, and wherein x is a vector (x J} x 2) x 3) ... x n ). The system operates by 
receiving a representation of a subbox X = (X h X 2) X n \ wherein for each 
dimension, i 9 the representation of X t includes a first floating-point number, a l9 
representing the left endpoint of X l} and a second floating-point number, b h 
representing the right endpoint of X x . For each nonlinear equation /(x) = 0 in the 
system of equations f(x) = 0, each individual component function ffa) = 0 can be 
written in the form g(x )) = h(x% where g can be analytically inverted so that an 
explicit expression for x ) can be obtained: x ) =g'\h(x)). 

For each component function f t there are different ways to analytically 
solve for a component Xj of the vector x. For each of these rearrangements, if a 
given interval box X is used as an argument of h, and if X/ = Xj n X' J9 where 
X) = g l {h{X)\ then the new interval X } for the >th component of X is guaranteed 
to be at least as narrow as the original X r 

This process is then be iterated for different terms g of each component 
function f x . After reducing any element Xj of the box X to X/, the reduced value 
can be used in X thereafter to speed up the reduction process using other 
component functions and terms thereof. 

Hereafter, the notation g(Xj) for a term of a component function f t (x) 
implicitly represents any term of any component function. This eliminates the 
need for additional subscripts that do not add clarity to the exposition. 

One embodiment of the present invention provides a system that receives a 
representation of a subbox X and then stores the representation in a computer 
memory. Next, the system applies term consistency to the set of nonlinear 
equations, fj(x) = 0, f 2 (x) = 0,f 3 (x) = 0, ... ?J /«(x) = 0, over X, and excludes 



6 

Attorney Docket No. SUN-P7075-SPL 

ARPH \SUN MICROSYSTEMS\SUN-P7075-SPL\SUN-P7075-SPL APPLICATION DOC 



Inventors: Walster et al. 



portions of X that violate the set of nonlinear equations. The system also applies 
box consistency to the set of nonlinear equations over X, and excludes portions of 
X that violate the set of nonlinear equations. Finally, the system performs an 
interval Newton step on X to produce a resulting subbox Y, wherein the point of 
expansion of the interval Newton step is a point x within X, and wherein 
performing the interval Newton step involves evaluating f(x) using interval 
arithmetic to produce an interval result f r (x). 

In a variation on this embodiment, while performing the interval Newton 
step, the system computes J(x,X), wherein J(x,X) is the Jacobian of the gradient 
of the function f evaluated as a function of x over the subbox X. The system then 
computes a preconditioned Jacobian, M(x,X) = B J(x,X), wherein B is an 
approximate inverse of the center of J(x,X). Next, the system uses the Hull 
method to solve for the resulting subbox Y that contains the values of y that 
satisfy M(x,X)(y-x) = r(x), where r(x) = -Bf(x). In the process, M(x,X) and 
J(x,X) are proved to be regular. An interval matrix is regular if it contains no 
singular matrices. 

In a variation on this embodiment, the system applies term consistency to 
the preconditioned set of nonlinear equations Bf(x) = 0 over the subbox X, and 
excludes portions of X that violate the preconditioned set of nonlinear equations. 

In a variation on this embodiment, the system applies box consistency to 
the preconditioned set of nonlinear equations Bf(x) = 0 over the subbox X, and 
excludes portions of X that violate the preconditioned set of nonlinear equations. 

In a variation on this embodiment, the system evaluates a first termination 
condition. This first termination condition is TRUE if zero is contained within 
f *(x), J(x,X) is regular, and Y is contained within X. If the first termination 
condition is TRUE, the system terminates and records X= X n Y as a final bound. 
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In further variation, the system evaluates a second termination condition. 
This second termination condition is TRUE if a function of the width of the 
subbox X is less than a pre-specified value, s x , and the width of the function f 
over the subbox X is less than a pre-specified value, s F . If the second termination 
condition is TRUE, the system terminates and records X as the final bound. 

BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1 illustrates a computer system in accordance with an embodiment of 
the present invention. 

FIG. 2 illustrates the process of compiling and using code for interval 
computations in accordance with an embodiment of the present invention. 

FIG. 3 illustrates an arithmetic unit for interval computations in 
accordance with an embodiment of the present invention. 

FIG. 4 is a flow chart illustrating the process of performing an interval 
computation in accordance with an embodiment of the present invention. 

FIG. 5 illustrates four different interval operations in accordance with an 
embodiment of the present invention. 

FIG. 6 illustrates a process of solving for zeros of a vector function that 
specifies a system of nonlinear equations using the interval Newton method. 

FIG. 7 illustrates another process of solving for zeros of a function that 
specifies a system of nonlinear equations using the interval Newton method in 
accordance with an embodiment of the present invention. 

FIG. 8 is a flow chart illustrating the process of finding an interval solution 
to a single nonlinear equation in accordance with an embodiment of the present 
invention. 

FIG. 9A is portion of a flow chart illustrating the process of finding an 
interval solution to a system of nonlinear equations in combination with the 
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interval Newton method for finding roots of nonlinear functions in accordance 
with an embodiment of the present invention. 

FIG. 9B is another portion of a flow chart illustrating the process of 
finding an interval solution to a system of nonlinear equations in accordance with 
an embodiment of the present invention. 

DETAILED DESCRIPTION 

The following description is presented to enable any person skilled in the 
art to make and use the invention, and is provided in the context of a particular 
application and its requirements. Various modifications to the disclosed 
embodiments will be readily apparent to those skilled in the art, and the general 
principles defined herein may be applied to other embodiments and applications 
without departing from the spirit and scope of the present invention. Thus, the 
present invention is not limited to the embodiments shown, but is to be accorded 
the widest scope consistent with the principles and features disclosed herein. 

The data structures and code described in this detailed description are 
typically stored on a computer readable storage medium, which may be any device 
or medium that can store code and/or data for use by a computer system. This 
includes, but is not limited to, magnetic and optical storage devices such as disk 
drives, magnetic tape, CDs (compact discs) and DVDs (digital versatile discs or 
digital video discs), and computer instruction signals embodied in a transmission 
medium (with or without a carrier wave upon which the signals are modulated). 
For example, the transmission medium may include a communications network, 
such as the Internet. 
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Computer System 

FIG. 1 illustrates a computer system 100 in accordance with an 
embodiment of the present invention. As illustrated in FIG. 1, computer system 
100 includes processor 102, which is coupled to a memory 1 12 and a to peripheral 
bus 1 10 through bridge 106. Bridge 106 can generally include any type of 
circuitry for coupling components of computer system 100 together. 

Processor 102 can include any type of processor, including, but not limited 
to, a microprocessor, a mainframe computer, a digital signal processor, a personal 
organizer, a device controller and a computational engine within an appliance. 
Processor 102 includes an arithmetic unit 104, which is capable of performing 
computational operations using floating-point numbers. 

Processor 102 communicates with storage device 108 through bridge 106 
and peripheral bus 110. Storage device 108 can include any type of non-volatile 
storage device that can be coupled to a computer system. This includes, but is not 
limited to, magnetic, optical, and magneto-optical storage devices, as well as 
storage devices based on flash memory and/or battery-backed up memory. 

Processor 102 communicates with memory 1 12 through bridge 106. 
Memory 1 12 can include any type of memory that can store code and data for 
execution by processor 102. As illustrated in FIG. 1, memory 1 12 contains 
computational code for intervals 1 14. Computational code 1 14 contains 
instructions for the interval operations to be performed on individual operands, or 
interval values 115, which are also stored within memory 112. This 
computational code 1 14 and these interval values 1 15 are described in more detail 
below with reference to FIGs. 2-5. 

Note that although the present invention is described in the context of 
computer system 100 illustrated in FIG. 1, the present invention can generally 
operate on any type of computing device that can perform computations involving 
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floating-point numbers. Hence, the present invention is not limited to the 
computer system 100 illustrated in FIG. 1. 

Compiling and Using Interval Code 

FIG. 2 illustrates the process of compiling and using code for interval 
computations in accordance with an embodiment of the present invention. The 
system starts with source code 202, which specifies a number of computational 
operations involving intervals. Source code 202 passes through compiler 204, 
which converts source code 202 into executable code form 206 for interval 
computations. Processor 102 retrieves executable code 206 and uses it to control 
the operation of arithmetic unit 104. 

Processor 102 also retrieves interval values 115 from memory 1 12 and 
passes these interval values 1 1 5 through arithmetic unit 104 to produce results 
212. Results 212 can also include interval values. 

Note that the term "compilation" as used in this specification is to be 
construed broadly to include pre-compilation and just-in-time compilation, as well 
as use of an interpreter that interprets instructions at run-time. Hence, the term 
"compiler" as used in the specification and the claims refers to pre-compilers, 
just-in-time compilers and interpreters. 

Arithmetic Unit for Intervals 

FIG. 3 illustrates arithmetic unit 104 for interval computations in more 
detail accordance with an embodiment of the present invention. Details regarding 
the construction of such an arithmetic unit are well known in the art. For 
example, see U.S. Patent Nos. 5,687,106 and 6,044,454, which are hereby 
incorporated by reference in order to provide details on the construction of such 
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an arithmetic unit. Arithmetic unit 104 receives intervals 302 and 312 as inputs 
and produces interval 322 as an output. 

In the embodiment illustrated in FIG. 3, interval 302 includes a first 
floating-point number 304 representing a first endpoint of interval 302, and a 
second floating-point number 306 representing a second endpoint of interval 302. 
Similarly, interval 312 includes a first floating-point number 314 representing a 
first endpoint of interval 312, and a second floating-point number 316 
representing a second endpoint of interval 312. Also, the resulting interval 322 
includes a first floating-point number 324 representing a first endpoint of interval 
322, and a second floating-point number 326 representing a second endpoint of 
interval 322. 

Note that arithmetic unit 104 includes circuitry for performing the interval 
operations that are outlined in FIG. 5. This circuitry enables the interval 
operations to be performed efficiently. 

However, note that the present invention can also be applied to computing 
devices that do not include special-purpose hardware for performing interval 
operations. In such computing devices, compiler 204 converts interval operations 
into a executable code that can be executed using standard computational 
hardware that is not specially designed for interval operations. 

FIG. 4 is a flow chart illustrating the process of performing an interval 
computation in accordance with an embodiment of the present invention. The 
system starts by receiving a representation of an interval, such as first floating- 
point number 304 and second floating-point number 306 (step 402). Next, the 
system performs an arithmetic operation using the representation of the interval to 
produce a result (step 404). The possibilities for this arithmetic operation are 
described in more detail below with reference to FIG. 5. 
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Interval Operations 

FIG. 5 illustrates four different interval operations in accordance with an 
embodiment of the present invention. These interval operations operate on the 
intervals X and Y. The interval X includes two endpoints, 

x denotes the lower bound of X, and 
x denotes the upper bound of X. 

The interval X is a closed subset of the extended (including -°° and +00) real 
numbers R* (see line 1 of FIG. 5). Similarly the interval Falso has two endpoints 
and is a closed subset of the extended real numbers R* (see line 2 of FIG. 5). 

Note that an interval is a point or degenerate interval if X- [x, x]. Also 
note that the left endpoint of an interior interval is always less than or equal to the 
right endpoint. The set of extended real numbers, R* is the set of real numbers, R, 
extended with the two ideal points negative infinity and positive infinity: 

i?* = i?U {-00} u {+00}. 

In the equations that appear in FIG. 5, the up arrows and down arrows 
indicate the direction of rounding in the next and subsequent operations. Directed 
rounding (up or down) is applied if the result of a floating-point operation is not 
machine-representable. 

The addition operation X+ Y adds the left endpoint of Xto the left 
endpoint of Y and rounds down to the nearest floating-point number to produce a 
resulting left endpoint, and adds the right endpoint of X\o the right endpoint of Y 
and rounds up to the nearest floating-point number to produce a resulting right 
endpoint. 
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Similarly, the subtraction operation X- 7 subtracts the right endpoint of 7 
from the left endpoint of X and rounds down to produce a resulting left endpoint, 
and subtracts the left endpoint of Y from the right endpoint of X and rounds up to 
produce a resulting right endpoint. 
5 The multiplication operation selects the minimum value of four different 

terms (rounded down) to produce the resulting left endpoint. These terms are: the 
left endpoint of X multiplied by the left endpoint of Y; the left endpoint of X 
multiplied by the right endpoint of 7; the right endpoint of X multiplied by the left 
endpoint of 7; and the right endpoint of X multiplied by the right endpoint of Y. 
10 This multiplication operation additionally selects the maximum of the same four 
terms (rounded up) to produce the resulting right endpoint. 

Similarly, the division operation selects the minimum of four different 
terms (rounded down) to produce the resulting left endpoint. These terms are: the 
left endpoint of X divided by the left endpoint of Y; the left endpoint of X divided 
1 5 by the right endpoint of Y; the right endpoint of X divided by the left endpoint of 
00 7; and the right endpoint of X divided by the right endpoint of 7 This division 

operation additionally selects the maximum of the same four terms (rounded up) 
to produce the resulting right endpoint. For the special case where the interval 7 
includes zero, X/7is an exterior interval that is nevertheless contained in the 
20 interval 1?*. 

Note that the result of any of these interval operations is the empty interval 
if either of the intervals, X or 7, are the empty interval. Also note, that in one 
embodiment of the present invention, extended interval operations never cause 
undefined outcomes, which are referred to as "exceptions" in the IEEE 754 
25 standard. 



c 
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Interval Version of Newton's Method for Systems of Nonlinear Equations 

FIG. 6 illustrates the process of using the interval Newton method without 
the present invention to solve for zeros of a vector function that defines a system 
of nonlinear equations. The system starts with a multi-variable function f(x) = 0, 
5 wherein x is a vector (*/, x 2 , x 3) ... x n ), and wherein f(x) = 0 represents a system of 
equations //(x) = 0,/ 2 (x) = 0,/ 3 (x) = 0, ...,/«(x) = 0. 

Next, the system receives a representation of a box X (step 602). In one 
embodiment of the present invention, for each dimension, /, the representation of 
X t includes a first floating-point number, a u representing the left endpoint of X, in 
10 the z-th dimension, and a second floating-point number, b h representing the right 
endpoint of X t . 

H The system then performs a Newton step on X, wherein the point of 

"!f expansion is typically x = m(X) the midpoint of the box X, to compute a resulting 

& 

!U interval X'= X n N(x,X) (step 604). 

q 1 5 Next, the system evaluates termination criteria A and B, which relate to the 

^ size of the box X' and the function f } respectively (step 606). Criterion A is 

p satisfied if the width of the interval X', w(X'), is less than e x for some user- 

p specified e x > 0, wherein w(X') is be defined as the maximum width of any 

p component^ of the interval X'. Note that s x is user-specified and is an absolute 

20 criterion. Criterion A can alternatively be a relative criterion w(X')/|X' | < s x if the 
box X' does not contain zero. Moreover, s x can be a vector, e x , wherein there 
exists a separate component s Xl for each dimension in the box X'. In this case, 
components containing zero can use absolute criteria, while other components use 
relative criteria. 

25 Criterion B is satisfied if | |f 1 1 < ^for some user-specified s F > 0, wherein 

||fl|=max(^(X , )| J L6(X')U/ 5 (X')|, ...,|/*(X')|). Note that as with s Xf element- 
specific values Sn can be used, but they are always absolute. 
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However they are defined, if criteria^ and B are satisfied, the system 
terminates and accepts X' as the final bounding box for the zeros of f (step 610). 
Otherwise, if either criterion A or criterion B is not satisfied, the system proceeds 
to evaluate criterion C (step 612). 

5 Criterion C is satisfied if three conditions are satisfied. A first condition is 

satisfied if zero is contained within f ! (x), wherein x is a point within the box X, 
and wherein f \x) is a box that results from evaluating f(x) using interval 
arithmetic. Note that performing the interval Newton step in step 604 involves 
evaluating f(x) to produce an interval result f T (x). Hence, f *(x) does not have to 

1 0 be recomputed for criterion C. 

A second condition is satisfied if M(x,X) = BJ(x,X) is regular. J(x,X) is 
the Jacobian (matrix of first order partial derivatives) of the vector function f as a 
function of the point x in the interval X. B is an approximate inverse of the center 
of J(x,X). Note that multiplying J(x,X) by B preconditions J(x,X) so it is easier 

15 to determine whether J(x,X) is regular. Hence, M(x,X) is referred to as the 
"preconditioned" Jacobian. 

Finally, a third condition is satisfied if X = X\ This indicates that the 
interval Newton step (in step 604) failed to make progress. 

If criterion C is satisfied, the system terminates and accepts X' as a final 

20 bounding box for the zeros off (step 616). 

Otherwise, if criterion C is not satisfied, the system returns for another 
iteration. This may involve splitting X' into multiple intervals to be separately 
solved if the Newton step has not made sufficient progress to assure convergence 
at a reasonable rate (step 618). The system then sets X=X' (step 620) and returns 

25 to step 604 to perform another interval Newton step. 
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The above-described process works well if tolerances sx and s F are chosen 
"relatively large". In this case, processing stops early and computing effort is 
relatively small. 

Alternatively, the process illustrated in FIG. 7 seeks to produce to best (or 
near best) possible result for simple zeros, again without the present invention. 
The system starts with a multi-variable function f(x) = 0. Next, the system 
receives a representation of a box X (step 702). The system then performs a 
Newton step on X, wherein the point of expansion is x e X, to compute a 
resulting interval X'= X n N(x,X) (step 704). 

Next, the system evaluates criterion C (step 705). If criterion C is 
satisfied, the system terminates and accepts X ' as a final bounding box for the 
zeros of f (step 708). Otherwise, if criterion C is not satisfied, the system 
proceeds to determine if M(x,X) is regular (step 709). 

If M(x,X) is regular, the system returns for another iteration. This may 
involve splitting X' into multiple intervals to be separately solved if the Newton 
step has not made sufficient progress to assure convergence at a reasonable rate 
(step 717). The system also sets X=X' (step 718) before returning to step 704 to 
perform another interval Newton step. 

If M(x,X) is not regular, the system evaluates termination criteria^ and B 
(step 712). If criteria A and B are satisfied, the system terminates and accepts X' 
as a final bounding box for the zeros of f (step 716). Otherwise, if either criterion 
A or criterion B is not satisfied, the system returns for another iteration (steps 717 
and 718). 

Term Consistency 

FIG. 8 is a flow chart illustrating the process of solving a single nonlinear 
equation through interval arithmetic and term consistency in accordance with an 
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embodiment of the present invention. This is the process designed to work well 
for large boxes X when otherwise the interval Newton method must repeatedly 
split the given box into sufficiently small subboxes. The system starts by 
receiving a representation of a nonlinear equation fix) = 0 (step 802), as well as a 
5 representation of an initial box X (step 804). Next, the system symbolically 
manipulates the equation/x) = 0 to solve for a term g(xj) = h(x), wherein the 
term g(xj) can be analytically inverted to produce the inverse function g 1 (step 
806). 

Next, the system substitutes the initial box X into h(X) to produce the 
1 0 equation g{X)) = h(X) (step 808). The system then solves for X) = g\h(X)) 

(step 810). The resulting interval X) is then intersected with the initial interval X } 

to produce a new interval X + j (step 812). 

At this point, the system can terminate. Otherwise, the system can 

perform further processing. This further processing involves setting X 3 = Xj + in X 
1 5 (step 8 14) and then either returning to step 806 for another iteration of term 

consistency on another term of fix), or by performing an interval Newton step on 

fix) and X to produce a new interval X + (step 816). At this point, if stopping 

criteria are satisfied (step 817), the system possibly splits the box (step 818) and 

returns to step 806. 

20 

Examples of Applying Term Consistency 

For example, suppose/^) -x 2 -x + 6. We can define g{x) = x 2 and 
fc(x)=jc-tf. Let J*T= [-10,10], The procedural step is (X') 2 = X-6=[- 16,4]. Since 
(X') 2 must be non-negative, we replace this interval by [0,4]. Solving forX', we 
25 obtain X' = ± [0,2]. In replacing the range of h(x) (i.e., [-16,4]) by non-negative 
values, we have excluded that part of the range h(x) that is not in the domain of 
g(x) =x 2 . 
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1 i 

Suppose that we reverse the roles of g and A and use the iterative step 
KX*) = gVO. That is X'- 6 = X 2 We obtain JT - [6,106]. Intersecting this 
result with the interval [-10,10], of interest, we obtain [6,10]. This interval 
excludes the set of values for which the range of g(X) is not in the intersection of 
5 the domain of h(X) with X. 

Combining these results, we conclude that any solution of 
fiX) = g(X) -h(X) = 0 that occurs inX= [-10,10] must be in both [-2,2] and 
[6,10]. Since these intervals are disjoint, there can be no solution in [-10,10]. 

In practice, if we already reduced the interval from [-10,10] to [-2,2] by 
10 solving for g, we use the narrower interval as input when solving for h. 

This example illustrates the fact that it can be advantageous to solve a 

jjsss; 

p given equation for more than one of its terms. The order in which terms are 

chosen affects the efficiency. Unfortunately, it is not known how best to choose 
tu the best order. 

Q 1 5 Also note that there can be many choices for g(x). For example, suppose 

^ we use term consistency to narrow the interval bound X on a solution of 

fx) = ax 4 + bx + c = 0, We can let g(x) = bx and compute X' = -(aX 4 + c)/b or 

0 we can let g(x) = ax 4 and compute X* = ± [-(bX+c)/a] 1/4 We can also separate x 4 
}2 into x 2 * x 2 and solve for one of the factors X' = ± [-(bX+c)/(aX 2 )] 1/2 . 

1 y 20 In the multidimensional case, we may solve for a term involving more than 

one variable. We then have a two-stage process. For example, suppose we solve 
for the term l/(x+y) from the fonctionyfoj/) = l/(x+y) - h(x,y) = 0. Let 
xeX= [1,2] and^ e F= [0.5,2]. Suppose we find that h(X,Y) = [0.5,1]. Then 
l/(x+y) e [0.5,1] so x+y e [1,2]. Now we replace^ by Y= [0.5,2] and obtain the 
25 bound [-1,1.5] onX. Intersecting this interval with the given bound X= [1,2] on 
x, we obtain the new bound X' = [ 1 , 1 ,5] . 
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We can use JT to get a new bound on h; but this may require extensive 
computing if h is a complicated function; so suppose we do not. Suppose that we 
do, however, use this bound on our intermediate result x + y = [1,2]. Solving for y 
as [1,2] ~X\ we obtain the bound [-0.5,1]. Intersecting this interval with 7, we 
obtain the new bound Y' = [0.5,1] ony. Thus, we improve the bounds on both x 
andy by solving for a single term of / 

The point of these examples is to show that term consistency can be used 
in many ways both alone and in combination with the interval Newton algorithm 
to improve the efficiency with which roots of a single nonlinear equation within a 
large box can be computed. The same is true for systems of nonlinear equations. 

Solving a System of Nonlinear Equations 

FIGs. 9A and 9B present a flow chart illustrating the process of finding an 
interval solution to a system of nonlinear equations using term consistency in 
combination with the interval Newton method for finding roots of nonlinear 
functions in accordance with an embodiment of the present invention. 

We assume that an initial box X (0) is given. We seek all zeros of the 
function f in this box. However, note that more than one box can be given. As 
the process proceeds, it usually generates various sub-boxes of X (0) . These sub- 
boxes are stored in a list L waiting to be processed. At any given time, the list L 
may be empty or may contain several (or many) boxes. 

The steps of the process are generally performed in the order given below 
except as indicated by branching. The current box is denoted by X even though it 
changes from step to step. We assume the tolerances s x and s F are given by the 
user. 
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The system first puts the initial box(es) in the list L (step 900), If the list L 
is empty, the system stops. Otherwise, the system selects the box most recently 
put in L to be the current box, and deletes it from L (step 901). 

Next, for future reference, the system stores a copy of the current box X. 
This copy is referred to as X (1) . If term consistency previously has been applied n 
times in succession without applying a Newton step, the system goes to step 908 
to do so. (In making this count, the system ignores any applications of box 
consistency.) Otherwise, the system applies term consistency to the equation 
ft(j) = 0 (/=i, n) for each variable x 3 (j=l,...,ri). To do so, the system cycles 
through both equations and variables, updating X whenever the width of an 
element^ is reduced. If any result is empty, the system goes to step 901 
(step 902). 

If X satisfies both Criteria^ and B discussed with reference to X' in FIG. 6 
above, the system records X as a solution and goes to step 901 (step 903). 

If the box X (1) was "sufficiently reduced" in step 902, the system repeats 
step 902 (step 904), Note that we need a criterion to test when a box is 
"sufficiently reduced" in size during a step or steps of our process. The purpose 
of the criterion is to enable us to decide when it is not necessary to split a box, 
thereby allowing any procedure that sufficiently reduces the box to be repeated. 
Let X denote a box to which the process is applied in a given step. Assume that X 
is not entirely deleted by the process. Then either X or a sub-box, say X', of X is 
returned. It may have been determined that a gap can be removed from one or 
more components of X'. If so, we may choose to split X' by removing the gap(s). 
For now, assume we ignore gaps. 

We avoid these difficulties by requiring that for some i=l,.„,n 9 we have 
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w(X t ) - w(Xj *) < yw(X) for some constant y where 0<y<l. This assumes that at 
least one component of X is reduced in width by an amount related to the widest 
component of X. We choose y=0.25. 

Thus, we define D=0.25w(X)-max ( u li „ ) (w(JSQ - w{Xi ')). We say that X is 
5 "sufficiently reduced" if D< 0. The system also unconditionally sets X = X\ 
Next, the system applies box consistency to the equation 
fix) = 0 (/=/, n) for each variable xj (j=l } . ..,«). (Box consistency is described 
in "Global Optimization Using Interval Analysis" by Eldon R. Hansen, Marcel 
Dekker, Inc., 1992.) If any result is empty, the system goes to step 901 (step 905). 
10 The system next repeats step 903 (step 906). 

If the current box X is a sufficiently reduced version of the box X (1) defined in 
step 902, the system goes to step 902 (step 907), 

If X is contained in a box X (2) to which the Newton method has been applied 
in step 91 1 below, the systems goes to step 909. Otherwise, the system goes to step 
15 910 (step 908). 

When the Newton method is applied to X (2) in step 911 below, a 
preconditioning matrix B and a preconditioned Jacobian M are computed. If M is 
regular, the system uses B in an "inner iteration procedure" to get an approximate 
solution x of f = 0 in X (step 909). (see Hansen, E. R. and Greenberg, R. L, 1983, An 
20 interval Newton method, Appl. Math. Comput. 12, 89-98.) Otherwise, if M is not 
regular, the system sets x = m(X). 

Next, the system computes J(x,X) using either an expansion in which 
some of the arguments are replaced by real (non-interval) quantities (see Hansen, 
E. R., 1968, On solving systems of equations using interval arithmetic, Math. 
25 Comput. 22, 374-384.) or else using slopes. If the point x was obtained in step 
909, the system uses it as the point of expansion. The system also computes an 
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approximation J c for the center of J(x,X) and an approximate inverse B of J c . 
The system additionally computes M(x,X) = BJ(x,X) and r(x) = -Bf(x) (step 910). 

Note that the system computes B using a "special procedure" so that a 
result B is computed even if J c is singular. When using the Gauss-Seidel 
method, we can regard preconditioning as an effort to compute a matrix that is 
diagonally dominant. Even when A c is singular, we can generally compute a 
matrix B such that some diagonal elements dominate the off-diagonal elements in 
their row. This improves the performance of Gauss-Seidel. To compute B when 
A is singular, we can begin the Gaussian elimination procedure. When a pivot 
element is zero, we replace it by some small quantity and proceed. This enables 
us to complete the elimination procedure and compute a result B. A similar step 
can be used if a triangular factorization method is used to try to invert A c . 

For further reference, the system saves X as X (2) . At this point, if M(x,X) 
is regular, the system computes the hull of the set of points y satisfying 
M(x,X)(y-x) = r(x) (see Hansen, E. R., 1988, Bounding the solution of interval 
linear equations, SIAM J. Numer. Anal, 28, 1493-1503). If M(x,X) is irregular, 
the system computes a bound on this set using one step of the Gauss-Seidel 
method. In either case, the system denotes the resulting box by Y. If Y n X is 
empty, the system goes to step 901 . Otherwise, the system replaces X by Y n X. 
(step 911). 

Next, the system repeats step 903 (step 912). 

If the Gauss-Seidel method was used in step 911, and if the box was 
sufficiently reduced, the system returns to step 911 (step 913). 

If Criterion C discussed with reference to FIG. 6 above is satisfied, the system 
records X as a solution and goes to step 901 (step 914). 

Using B from step 910, the system next determines the analytically 
preconditioned function Bf(x) (see Hansen, E. R., 1992, Preconditioning 
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linearized equations, Computing, 58, 187-196). The system then applies term 
consistency to solve the z-th equation of Bf(x) = 0 to bound x } for (/=1 ,...,«). If 
any result is empty, the system goes to step 901 (step 915). 
The system then repeats step 903 (step 916). 

Next, the system applies box consistency to solve the fth equation of the 
analytically preconditioned system Bf(x)=0 to bound Xj for (j=l ,...,«). If any result 
is empty, the system goes to step 901 (step 917). 

Next, the system repeats step 903 (step 918). 

If the Newton method in step 91 1 reduced the width of the box X (2) by a 
factor of eight or more, the system goes to step 908 (step 919). 

If X is a sufficiently reduced version of the box X (1) , the system goes to step 
901 (step 920). 

Otherwise, the system splits X and then proceeds to step 901 (step 921). 

Note that after termination in step 901, bounds on all solutions of f(x) in the 
initial box X (0) have been recorded. A bounding boxXrecorded in step 903 satisfies 
the conditions w(X) < s x and width(f(x)) < specified by the user. A box X 
recorded in step 913 approximates the best possible bounds that can be computed 
with the number system used. 

The foregoing descriptions of embodiments of the present invention have 
been presented only for purposes of illustration and description. They are not 
intended to be exhaustive or to limit the present invention to the forms disclosed. 
Accordingly, many modifications and variations will be apparent to practitioners 
skilled in the art. For example, although the present invention describes the use of 
derivatives in certain situations, it is often possible to use slopes instead of 
derivatives. 

Additionally, the above disclosure is not intended to limit the present 
invention. The scope of the present invention is defined by the appended claims. 
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